Sensitivity to the Higgs Self-coupling Using the ZHH 
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The Standard Model predicts the value of the Higgs self-coupling but it cannot be 
measured at LHC. This measurement requires a machine such as the proposed Inter- 
national Linear Collider. Here, the sensitivity to the Higgs self-coupling is evaluated 
using the ZHH to six jets channel for a Higgs mass of 120 GeV/c 2 . Full simulation has 
been carried out for an integrated luminosity of 500 fb" 1 . Several analyses are pre- 
sented and all evaluate the cross section resolution to be about 180%. Potential areas 
for improvement are identified. 

1 Introduction 

At the energy of the International Linear Collider (ILC), the process e + e~ — > ZHH is the 
only one that can be used to measure the self-coupling of the Higgs boson. This study was 
performed assuming Mr = 120 GeV and a centre-of-mass energy of 500 GcV at which the 
total cross section is 0.183 fb, assuming a polarisation of -80% for the electron beam. The 
main decay mode is the six-jet final state with a BR of about 40%, which is the only final 
state considered in this analysis. The integrated luminosity was assumed to be 500 fb -1 
which corresponds to the planned integrated luminosity for the first phase of ILC. 

2 Generation, simulation and reconstruction 

The events used in the ZHH analysis were generated using Pandora Pythia [2] and WHIZARD 
[3]. The two generators were compared and are compatible. Since it is computationally im- 
possible to perform the simulation for the whole 500 fb -1 , only events with six quarks in 
the final state and a selection of four-jet final states were considered. Table [1] summarises 
the events generated. The main background is the hadronic tt channel which has a total 
cross section of 326 fb. For the ZZ channel it was required to have at least one Z decaying 
in heavy quarks (c, b). 

The detector simulation was performed using MOKKA v00-06-04p02. The detector 
model used was LDCOOSc g]. 

The simulated events were reconstructed using Marlin v00-09-10. The hits in the track- 
ing and calorimetry systems were digitised and then used as input for the tracking (FullLDC 
package) and particle flow reconstruction (PandoraPFA package) . The particles were forced 
to six jets using the Durham algorithm and the jets were analysed by the vertex recon- 
struction software (LCFI package) to reconstruct b and c vertices. The same reconstruction 
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Table 1 : Signal and principal background channels and number of events generated for each. 
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chain was performed using the "perfect" particle flow reconstruction, in which all particles 
are correctly reconstructed, essentially neglecting any confusion term from the calorimeters. 
Details of all the software used can be found in [5J. The reconstructed particles and the 
jets, for both realistic and perfect PFA chain of reconstruction, were then used to calculate 
several shape variables which were used in this analysis 



3 Cut based analysis 

A key variable used to separate signal from background was the sum of all outputs from the 
b tagging neural network. This is a number from to 1 for each jet, where 1 indicates b-like 
jets and light-like jets. Having 6 jets, the variable used in the analysis varies between and 
6 with the signal peaking at 4 and the main background at 2. The other variables used were: 
thrust, co&9 t f lrustl second Fox- Wolfram moment, total energy, number of tracks, number of 
particles in jet, angular distance between jets (Y6) and jet EM energy ratio. These variables 
were optimised by scanning simultaneously all variables over a wide range of values. However 
the high number of variables made it difficult to test many values because of the processing 
time and the memory requirement. A satisfactory compromise was found using five cut 
values for each variable and reiterating the process to find the exact maximum. For each 
iteration, among all possible combination of cuts (there are 5 s ), the one that maximised 
the usual figure of merit S/ V / STB was chosen. After few iterations the value of S/VS + B 
did not improve any further and the process was ended. The final value for S / VS + B after 
applying all the cuts was 0.364±0.011. A similar optimisation was performed for the perfect 
PFA reconstruction obtaining S/VS + B = 0.361 ± 0.010. 

In order to further separate signal from background, a \ 2 was built to force the recon- 
struction of each event to ZHH: 

a _ (Mjj - M z f (Mm - M H f (M mn - M H ) 2 

°Z °H °H 

All forty five combinations of the six jets were tried. The combination that produced the 
smallest x 2 defined the Xmin f° r tnat event - X m in was tnen use< ^ to discriminate signal from 
backgrounds. However the large number of tt passing the previous cuts can be reconstructed 
to look similar to ZHH events due to the high combinatorial in jet pairing. For this reason 
a second \ 2 was built adding the b tagging information: 
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The new term uses the b tag information with the value of A which has been found to 
be very large after an optimisation. Since the four jets from the two Higgs bosons should 
be b jets, the output of the b tagging neural network should peak at 1 hence the sum of 
the four terms should peak at zero for well reconstructed and well associated jets in signal 
events. Since b-like jets are more likely to form one of the Higgs boson instead of the Z, 
the new term effectively reduces the number of possible combinations. This reduction in 
combinations has a small impact on the signal but reduces all the backgrounds. 

The optimisation of the parameter A was performed varying the value from to 10 5 . For 
each value of A the same procedure described before was performed; the minimised \ 2 was 
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plotted for signal and background and from this distribution the S/yS + B was maximised. 
Figure [T] shows the maximum of S/VS + B as a function of A. The error is the statistical 
error mainly due to the limited number tt events. For values above 100 the separation is 
constant at a value of 0.55 ± 0.06. 

Kinematic fit of the six jets to fur- 
ther constrain the reconstruction was im- 
plemented but the separation was not im- 
proved. This and the fact that the opti- 
misation of A is asymptotic and does not 
have a peak, are indications that the mass 
resolution is less important than the b tag- 
ging performance. A separate analysis on 
the events with perfect PFA reconstruc- 
tion confirmed this indication. In fact the 
separation for the perfect reconstruction is 
S/VSTB = 0.59 ± 0.06, which, within the 
statistical error, is compatible with the case 
of realistic PFA. This means that any im- 
provement in the PFA will not reflect in a 
better separation in this analysis. In order 
to have a better separation the other main 
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Figure 1: S/vS + B as a function of A. 
element of the selection, the vertex reconstruction for the b tagging, has to be improved. 



4 Neural network analysis 

A second analysis was performed using a neural network, which in principle should give 
a better separation between the signal and the backgrounds then the cut based one. The 
network implementation was performed using the artificial neural network (ANN) package 
within TMVA [6]. A separate sample of signal and background events was generated to 
train the neural network. For the background, an integrated luminosity of 125 fb -1 was 
generated while for the signal 30000 events were used. The preliminary cuts described 
above were applied to the training sample and the events passing the cuts were used to 
train the network. Due to the limited number of events left, only a simple network could 
be trained; in particular two configurations were studied. The variables used were the b 
tagging, the Xzhh an< ^ the x%- The two % 2 variable are defined in Eq. Q] but the b jets 
were forced to came from the Higgs or from the decay of the top. 

The S/vS + B were obtained as before, looking for the maximum as a function of the 
neural network output; the results are summarised in Tabled Within the statistical error, 
neither of the two networks improved the separation between signal and background. This 
is a further confirmation that, at the moment, the mass information does not have any 
discriminating power. 



5 Conclusion 



Given the relevance of the b tagging performance in the analysis, a dedicated study was 
performed to study the performance in the six-jet environment. The efficiency in b tagging 
for b, c and light jets was compared between the two-jet and the six-jet environment. The 
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0.36 ±0.01 
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1364.5 


X 2 with b tag term 


0.55 ±0.06 
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47.0 


X 2 with b tag term and kin. fit. 


0.56 ±0.06 


6.4 


124.4 


NN two variables 


0.57 ±0.06 


5.8 


99.2 


NN three variables 


0.55 ±0.06 


7.5 


186.0 



comparison showed an increase of about 25% in the fake rate from c jets while the light jets 
in the six-jet environment had a fake rate doubled with respect to the two-jet environment. 
This increase is due to the different environment but also to the different energy of the jets. 
A separate study of two-jet events of different energy showed that the fake rate increases at 
higher energies with respect to the nominal value obtained with jets from the decay of a Z 
boson at rest. 

If a similar per- 
formance could be 
achieved in the six- 
jet environment as 
in the two-jet events, 
the resolution would 
improve by a fac- 
tor two. Then, 
without performing 
any further optimi- 
sation, the resolu- 
tion on the ZHH cross section would be about 95%. This value is not too distant from 
those obtained in fast Monte Carlo studies, about 80% for [7] and 60% for [8], if the same 
integrated luminosity is considered and if effects such as the gluon emission are considered. 
The remaining difference is likely to be due to detector effects, such as confusion in particle 
reconstruction. It is important to stress the fact that this is an indirect comparison; in order 
to have an accurate estimate of the differences between fast and full simulation, the same 
events should be compared using the same analysis. Further improvement could be achieved 
considering also the decay of the Z to neutrinos. 
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